- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources3
- Resource Type
-
0002100000000000
- More
- Availability
-
30
- Author / Contributor
- Filter by Author / Creator
-
-
Huang, Ruizhe (3)
-
Khudanpur, Sanjeev (3)
-
Povey, Dan (3)
-
Trmal, Jan (3)
-
Ehlen, Patrick (2)
-
Liu, Jing (2)
-
Raj, Desh (2)
-
Yarmohammadi, Mahsa (2)
-
Yu, Mingzhi (2)
-
Garcia, Leibny P (1)
-
Garcia-Perera, Leibny Paola (1)
-
Ivanov, Alexei (1)
-
Ivanov, Alexei V (1)
-
Paola_Garcia, Leibny (1)
-
Wiesner, Matthew (1)
-
#Tyler Phillips, Kenneth E. (0)
-
#Willis, Ciara (0)
-
& Abreu-Ramos, E. D. (0)
-
& Abramson, C. I. (0)
-
& Abreu-Ramos, E. D. (0)
-
- Filter by Editor
-
-
Calzolari, Nicoletta (1)
-
Hoste, Veronique (1)
-
Kan, Min-Yen (1)
-
Lenci, Alessandro (1)
-
Sakti, Sakriani (1)
-
Xue, Nianwen (1)
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech RecognitionCalzolari, Nicoletta; Kan, Min-Yen; Hoste, Veronique; Lenci, Alessandro; Sakti, Sakriani; Xue, Nianwen (Ed.)Knowing the particular context associated with a conversation can help improving the performance of an automatic speech recognition (ASR) system. For example, if we are provided with a list of in-context words or phrases — such as the speaker’s contacts or recent song playlists — during inference, we can bias the recognition process towards this list. There are many works addressing contextual ASR; however, there is few publicly available real benchmark for evaluation, making it difficult to compare different solutions. To this end, we provide a corpus (“ConEC”) and baselines to evaluate contextual ASR approaches, grounded on real-world applications. The ConEC corpus is based on public-domain earnings calls (ECs) and associated supplementary materials, such as presentation slides, earnings news release as well as a list of meeting participants’ names and affiliations. We demonstrate that such real contexts are noisier than artificially synthesized contexts that contain the ground truth, yet they still make great room for future improvement of contextual ASR technology.more » « less
-
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech RecognitionHuang, Ruizhe; Yarmohammadi, Mahsa; Trmal, Jan; Liu, Jing; Raj, Desh; Garcia, Leibny P; Ivanov, Alexei; Ehlen, Patrick; Yu, Mingzhi; Povey, Dan; et al (, ELRA and ICCL)Knowing the particular context associated with a conversation can help improving the performance of an automatic speech recognition (ASR) system. For example, if we are provided with a list of in-context words or phrases — such as the speaker’s contacts or recent song playlists — during inference, we can bias the recognition process towards this list. There are many works addressing contextual ASR; however, there is few publicly available real benchmark for evaluation, making it difficult to compare different solutions. To this end, we provide a corpus (“ConEC”) and baselines to evaluate contextual ASR approaches, grounded on real-world applications. The ConEC corpus is based on public-domain earnings calls (ECs) and associated supplementary materials, such as presentation slides, earnings news release as well as a list of meeting participants’ names and affiliations. We demonstrate that such real contexts are noisier than artificially synthesized contexts that contain the ground truth, yet they still make great room for future improvement of contextual ASR technologymore » « less
-
Huang, Ruizhe; Wiesner, Matthew; Garcia-Perera, Leibny Paola; Povey, Dan; Trmal, Jan; Khudanpur, Sanjeev (, CASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))
An official website of the United States government

Full Text Available